Generating Sentences by Editing Prototypes

نویسندگان

  • Kelvin Guu
  • Tatsunori B. Hashimoto
  • Yonatan Oren
  • Percy Liang
چکیده

We propose a new generative model of sentences that first samples a prototype sentence from the training corpus and then edits it into a new sentence. Compared to traditional models that generate from scratch either left-toright or by first sampling a latent sentence vector, our prototype-then-edit model improves perplexity on language modeling and generates higher quality outputs according to human evaluation. Furthermore, the model gives rise to a latent edit vector that captures interpretable semantics such as sentence similarity and sentence-level analogies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Weighting Prototypes . A New Editing Approach

It is well known that editing techniques can be applied to (large) sets of prototypes in order to bring the error rate of the Nearest Neighbour classifier close to the optimal Bayes risk. However, in practice, the behaviour of these techniques uses to be much worse than expected from the asymp-totic predictions. A novel editing technique is introduced here which explicitly aims at obtaining a g...

متن کامل

Weighting Prototypes. A New Editing Approach

It is well known that editing techniques can be applied to (large) sets of prototypes in order to bring the error rate of the Nearest Neighbour classifier close to the optimal Bayes risk. However, in practice, the behaviour of these techniques uses to be much worse than expected from the asymp-totic predictions. A novel editing technique is introduced here which explicitly aims at obtaining a g...

متن کامل

Monolingual Post-Editing by a Domain Expert is Highly Effective for Translation Triage

Various small-scale pilot studies have found that for at least some documents, monolingual target language speakers may be able to successfully post-edit machine translations. We begin by analyzing previously published post-editing data to ascertain the effect, if any, of original source language on post-editing quality. Schwartz et al. (2014) hypothesized that post-editing success may be more ...

متن کامل

Editing Prototypes in the Finite Sample Size Case Using Alternative Neighbourhoods

The recently intro(hwed concept of Nearest Centroid Neight)orhood is applied to discard outlirrs and prototypes in cl,~s overlapping regions in order to improve the performance of the Nearest Neighbor rule through an etliting i)rocedure. This apl)roach is related to graph b~sed editing algorithms which also define alternatiw, neighborhoods in t[,rms of geometric relations. Cl,~si('al e([iting a...

متن کامل

Advancing Chimeric Antigen Receptor-Engineered T-Cell Immunotherapy Using Genome Editing Technologies: Challenges and Future Prospects

Chimeric antigen receptor engineered-T (CAR-T) cells also named as living drugs, have been recently known as a breakthrough technology and were applied as an adoptive immunotherapy against different types of cancer. They also attracted widespread interest because of the success of B-cell malignancy therapy achieved by anti-CD19 CAR-T cells. Current genetic toolbox enabled the synthesis of CARs ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1709.08878  شماره 

صفحات  -

تاریخ انتشار 2017